Optimization of GEMV on Intel AVX Processor

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ELZAR: Triple Modular Redundancy using Intel AVX

Instruction-Level Redundancy (ILR) is a well known approach to tolerate transient CPU faults. It replicates instructions in a program and inserts periodic checks to detect and correct CPU faults using majority voting, which essentially requires three copies of each instruction and leads to high performance overheads. As SIMD technology can operate simultaneously on several copies of the data, i...

متن کامل

Fast Sorting Algorithms using AVX-512 on Intel Knights Landing

The modern CPU’s design, which is composed of hierarchical memory and SIMD/vectorization capability, governs the potential for algorithms to be transformed into efficient implementations. The release of the AVX-512 changed things radically, and motivated us to search for an efficient sorting algorithm that can take advantage of it. In this paper, we describe the best strategy we have found, whi...

متن کامل

Explorer ELZAR : Triple Modular Redundancy Using Intel AVX ( Practical Experience

Instruction-Level Redundancy (ILR) is a well known approach to tolerate transient CPU faults. It replicates instructions in a program and inserts periodic checks to detect and correct CPU faults using majority voting, which essentially requires three copies of each instruction and leads to high performance overheads. As SIMD technology can operate simultaneously on several copies of the data, i...

متن کامل

A Novel Hybrid Quicksort Algorithm Vectorized using AVX-512 on Intel Skylake

The modern CPU’s design, which is composed of hierarchical memory and SIMD/vectorization capability, governs the potential for algorithms to be transformed into efficient implementations. The release of the AVX-512 changed things radically, and motivated us to search for an efficient sorting algorithm that can take advantage of it. In this paper, we describe the best strategy we have found, whi...

متن کامل

: Optimization for the Intel ®

a s t a c k f r a m e c a n b e d y n a m i c a l l y c h a n g e d a t a n y t i m e. A s t a c k f r a m e b e g i n s a t a p o s i t i o n m a r k e d b y t h e " C u r r e n t F r a m e M a r k e r " (C F M) , a n d e n d s a t t h e " T o p O f S t a c k " o c a t e d a t t h e b o t t o m o f t h e s t a c k , a n d n o t r a n d o m l y s c a t t e r e d a l l o v e r t h e f r a m e , ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Database Theory and Application

سال: 2016

ISSN: 2005-4270,2005-4270

DOI: 10.14257/ijdta.2016.9.2.06